Design and implementation of an efficient CNN accelerator for low-cost FPGAs
نویسندگان
چکیده
This paper proposes a computation-array-centered dataflow, which adjusts the convolution with different kernel sizes to unified computing manner and reduces dimension of computation array from 2D 1D, so as maximize utilization elements offered by accelerator. Furthermore, single unit multiple data (SUMD) strategy is proposed effectively alleviate mismatch between quantized hardware resources fixed bit width on FPGA. As case study, an 8-bit MobileNetV2 model has been implemented low-cost ZYNQ XC7Z020 FPGA, whose FPS/DSP GOPS/DSP achieve upto 0.55 0.35 respectively.
منابع مشابه
development and implementation of an optimized control strategy for induction machine in an electric vehicle
in the area of automotive engineering there is a tendency to more electrification of power train. in this work control of an induction machine for the application of electric vehicle is investigated. through the changing operating point of the machine, adapting the rotor magnetization current seems to be useful to increase the machines efficiency. in the literature there are many approaches wh...
15 صفحه اولnano-rods zno as an efficient catalyst for the synthesis of chromene phosphonates, direct amidation and formylation of amines
چکیده ندارد.
Efficient Edge Detection on Low-Cost FPGAs
Improving the efficiency of edge detection in embedded applications, such as UAV control, is critical for reducing system cost and power dissipation. Field programmable gate arrays (FPGA) are a good platform for making improvements because of their specialised internal structure. However, current FPGA edge detectors do not exploit this structure well. A new edge detection architecture is propos...
متن کاملdesign of an analog ram (aram)chip with 10-bit resolution and low-power for signal processing in 0/5m cmos process
برای پردازش سیگنال آنالوگ در شبکه های عصبی ، معمولا نیاز به یک واحد حافظه آنالوگ احساس میشود که بدون احتیاج به a/d وd/a بتواند بطور قابل انعطاف و مطمئن اطلاعات آنالوگ را در خود ذخیره کند. این واحد حافظه باید دارای دقت کافی ، سرعت بالا ، توان تلفاتی کم و سایز کوچک باشد و همچنین اطلاعات را برای زمان کافی در خود نگهدارد. برای پیاده سازی سیستمی که همه این قابلیتها را در خود داشته باشد، کوشش...
15 صفحه اولImplementing Efficient Low-Power PCIe Interfaces with Low-Cost FPGAs
A history of architectural and process advancements has enabled Altera® Cyclone® V FPGAs to be used in numerous low-cost and low-power applications in the industrial, automotive, military, communication and consumer markets, among others. This white paper outlines a real-life PCI Express® (PCIe®) Gen1x4 reference design including a DDR3 memory controller. It shows just how effective Cyclone V F...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEICE Electronics Express
سال: 2022
ISSN: ['1349-2543', '1349-9467']
DOI: https://doi.org/10.1587/elex.19.20220370